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Various specific human SP-18 and human SP-5 derived peptides have alveolar surfactant protein (ASP) activity. These 
peptides are prepared using synthetic methods or by recombinant techniques. 
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ALVEOLAR SURFACTANT PROTEINS 



Technical Field 

The invention relates generally to alveolar 
surfactant proteins (ASP) which are useful in the manage- 
ment of certain respiratory diseases . 

Background Art 

The human lung is composed of a large number of 
small sacs or alveoli in which gases are exchanged between 
the blood and the air spaces of the lung- In healthy 
individuals, this exchange is mediated by the presence of 
a protein-containing surfactant complex which is 
synthesized in the microsomal membranes of type II 
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alveolar cells. In the absence of adequate levels of this 
complex, a lung cannot properly function — i.e., the 
alveoli collapse during exhalation, and cannot be 
subsequently re-inflated by inhaling. Thus, the untreated 
inability to synthesize this complex may result in death 
or in severe physical damage if left untreated. 

The best documented instance of inadequate 
surfactant complex levels occurs in premature infants and 
infants born after complicated pregnancies, and is widely 
known as respiratory distress syndrome (RDS). A widely 
publicized form of this syndrome has been designated 
hyaline membrane disease, or idiopathic RDS. RDS is cur- 
rently the leading cause of infant mortality and morbidity 
in the United States and in other developed countries, and 
substantial efforts have been directed to diagnosis and 
treatment. Current treatment has focused on mechanical 
(pressure) ventilation which, at best, is an invasive 
stop-gap measure that often results in damage to the lung 
and other deleterious side effects, including complica- 
tions such as bronchopulmonary dysplasia, interstitial 
emphysema and pneumothorax. Mental retardation has. also 
resulted on occasion when this treatment was used 
(Hallman, M. , et al. Pediatric Clinics of North America 
(1982) 22:1057-1075). 

Limited attempts have been made to treat the 
syndrome by surfactant substitution. This would be a 
method of choice, as, in general, only one administration 
is required, and the potential for damage is reduced. For 
example, Fujiwara, et al. Lancet (1980) l:55-used a 
protein-depleted surfactant preparation derived from 
bovine lungs, while Hallman, M. , et al , Pediatrics (1983) 
71:473-482 used a surfactant isolated from human amniotic 
fluid to treat a limited number of infants with some suc- 
cess. U.S. Patent 4,312,860 to Clements discloses an 
artificial surfactant which contains no protein and is 
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said to be useful in this approach although no data are 
shown. In short/ surfactant substitution has not been 
widely used clinically. 

The preferred surfactant substitute would be the 
lung surfactant complex itself. This complex is composed 
of apoprotein, two phospholipids (dipalmitoyl 
phosphocholine (DPPC) and phosphatidyl-glycerol (PG)) 
which are present in major amount, several lipid 
components present in only very minor amount, and calcium 
ions • The apoprotein contains proteins having molecular 
weights of the order of 32,000 daltons and very hydro- 
phobic proteins of the order of about 10,000 daltons 
(King, R.J. et al. Am J Physiol (1973) 224 :788-795) . The 
32,000 dalton protein is glycosylated and contains 
hydr oxypr o 1 ine . 

A major reason for the limited progress in 
surfactant replacement therapy has been the lack of avail- 
ability of the protein portion of the complex. Replace- 
ment therapies have focused on attempts to use the lipid 
components alone, and it appears that the performance of 
such treatment can be markedly improved by addition of the 
apoprotein (Hallman, M. , et al. Pediatric Clinics of North 
America (1982) (supra)). At present, however, these 
proteins are available only from normal adult human lung, 
and from amniotic fluid. Even efficient isolation 
procedures would not provide an adequate supply. Thus, it 
would be desirable to have available a method for produc- 
ing practical quantities of apoprotein for use alone or in 
conjunction with the saturated phospholipid portion of the 
complex . 

Related PCT patent application WO86/03408 
describes the recombinant production of the human ASP 
protein of about 32 kd, the retrieval of DNA sequences 
encoding various canine ASP proteins and the retrieval of 
a single representative of the human ASP protein group of 



20 



25 



30 



35 



wo 91/18015 



PCT/US9 1/03490 



-4- 



10 



15 



20 



25 



30 



35 



about 10 kd molecular weight. It is now clear that ef- 
ficient production of the "lOK" group is required for use 
in adequate therapy. 

The additional related PCT patent application 
WO87/06588, published 5 November 1987, gives further 
description of these lOK proteins and their encoding DNAs . 
Figures 1 and 2 of that application show the full-length 
cDNAs encoding precursors of canine and human SP-18- 
derived protein- The mature human protein is described to 
begin at the phenylalanine residue encoded at codon 201 of 
sthe full-length sequence. The construction of vectors 
for expression of the SP-18 precursor in both mammalian 
and bacterial cells is described in detail. Expression of 
the full-length precursor in mammalian cells yielded 43 kd 
and 25 kd precursor proteins as determined on SDS-PAGE. 
The 25 kd product is stated to be the glycosylated form of 
a 181 amino acid sequence spanning Phe-20 l-Glu-381 encoded 
in this sequence. Certain modified forms of the human 
protein to provide cleavage sites which may be helpful in 
providing more uniform production of mature forms of the 
precursor are also described. Bacterial expression of the 
SP-18 cDNA is also described. 

Figures 5 and 6 of PCT application WO87/06588 
show the DNA and deduced amino acid sequences of two cDNA 
clones encoding the precursors for the smaller molecular 
weight 5 kd-8 kd proteins, designated SP-5 . Like the SP- 
18 cDNA, these clones are disclosed to encode a precursor 
for the smaller 5 kd-8 kd proteins isolated. The putative 
N-terminus is stated to be Phe -^r Gly at codons 24 and 25 
of this sequence; it is postulated that the mature C- 
terminus of these proteins is at Gln-108 for the 8 kd 
protein and Glu-80 or Thr-65 for the 5 kd protein. 
Expression of this cDNA in mammalian and bacterial cells 
is also described . 



o 



10 



wo 91/18015 PCT/US9 1/03490 

-5- 

The disclosures of the two above-cited PCT ap- 
plications, WO86/03408 and WO87/06588, are incorporated 
herein by reference, 
^ The present application describes various SP-5- 

related peptides which are effective as lung surfactant 
proteins. These SP-5 analogs and fragments can be 
prepared by chemical synthesis or by recombinant methods 
and offer specific members of the repertoire of lung 
surfactant proteins useful in treatment of respiratory 
diseases and symptomologies - 

The parent application hereto, U.S. application 
serial no. 07/117,009, also describes the lOK group of 
proteins in some detail. The disclosure of that applica- 
tion is hereby incorporated by reference in its entirety, 
and reference may be had thereto for material not 
explicitly described or explained herein. The present 
application is based on further studies of the human SP-5 
protein, and in particular is directed to analogs of that 
protein which have now been found to have ASP activity. 
The analogs presently described and claimed, in addition 
to retaining the stability and biological activity of the 
native polypeptide, are less susceptible to aggregation 
than native 5 kd protein. 
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Disclosure of the Invention 

The invention provides specific forms of human 
SP-18- and SP-5-derived proteins. Some of these which are 
analogs of the encoded sequence display substantially 
reduced aggregation relative to the native protein, i.e., 
aggregation resulting from various types of intramolecular 
and intermolecular interaction, primarily covalent, 
disulfide bonding. These analogs are therefore much 
easier to extract and purify than the native polypeptide. 

The present SP-5-derived peptides result- from 
modifications in both the length and the amino acid 
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sequence of human SP-5, but retain chemical and physical 
stability as well as the biological activity of the native 
polypeptide . 

In other aspects of the invention, pharmaceuti- 
cal compositions for treating respiratory distress 
syndrome are provided, the compositions formulated so as 
to contain an SP-18- and/or SP-5-related peptide. The 
invention also encompasses a method of treating respira- 
tory distress syndrome by administration of an SP-18- and/ 
or SP-5-related peptide. 

Brief Description of the Drawings 

Figure 1 shows the DNA and deduced amino acid 
sequence of a cDNA encoding human SP-5-derived protein. 

Figure 2 shows an analogous cDNA variant encod- 
ing human SP-5-derived protein. 

Figure 3 is the amino acid sequence encoded by 
codons 1-74 of SP-5 DNA with the N-terminus and C-terminus 
as marked. 

Figure 4 is the human cDNA#3 encoding the SP-18 
precursor protein. 

Figure 5 is the amino acid sequence of the 
canine 5 kd protein, with the N-terminus and C-terminus as 
marked . 

Figure 6 illustrates the correlation between the 
human and canine 18 kd proteins in the lOK ASP mixture. 

Figure 7 shows the DNA and amino acid sequences 
for chloramphenicol amino transferase (CAT) and human 
atrial natriuretic protein ( hANP ) . 

Figure 8 shows the protein encoding the insert 
in PC210SP-C. 

Figure 9 is the BamHI/Hindlll insert of pC149SP- 
C which encodes a CAT-SP-5 fusion protein. 

Figures 10 through 18 are graphic representa- 
tions of the in vitro results obtained with various poly- 
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peptides in standard tests for ASP activity. Figures 10 
and 11 represent control testing done with the full human 
5 kd protein, while Figures 12 through 18 represent the 
results obtained with various analogs of the protein. 
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Modes of Carrying Out the Invention 
Definitions 

As used herein, ''alveolar surfactant protein 
(ASP)" refers to apoprotein associated with the lung 
surfactant complex and having ASP activity as defined 
hereinbelow. The ASP of all species examined appears to 
comprise one or more components of relatively high 
molecular weight (of the order of 32 kd) designated herein 
"32K ASP" and one or more quite hydrophobic components of 
relatively low molecular weight (of the order of 5-20 kd) 
designated herein "lOK ASP". (King, R. J. , et al, J Appl 
Physiol (1977) 42:483-491; Phizackerley , P.J.R,, Biochem J 
(1979) 183:731-736.) 

Further discussion of the nature of the 
surfactant proteins known to occur in mammals is found in 
WO87/06588. Briefly, as there described, the "lOK" group 
of proteins is derived from precursors encoded by two dif- 
ferent DNAs. One set of these DNAs, designated SP-18, • 
encodes a precursor for proteins which appear at ap- 
proximately 18 kd on gels, but which show molecular 
weights of 10 kd under reducing conditions. The other 
DNA, designated SP-5, encodes precursors for proteins 
which show molecular weights of 8 kd or 5 kd on gels. The 
invention herein concerns specific peptides related to 
those generated by the SP-18 and SP-5 precursor proteins. 

••ASP activity" for a protein is defined as the 
ability, when combined with lipids either alone or in 
combination with other proteins, to exhibit activity in 
the in vivo assay of Robertson, B., Lung (1980) 158 : 57-68 . 



BNSOOCID: <WO 9neoi5A1 J_> 



wo 91/18015 



PCT/US91/03490 



-8- 



10 



15 



20 



25 



30 



35 



In this assay, the sample to be assessed is administered 
through an endotracheal tube to fetal rabbits or lambs 
delivered prematurely by Caesarian section. (These 
"preemies" lack their own ASP, and are supported on a 
ventilator-) Measurements of lung compliance, blood gases 
and ventilator pressure provide indices of activity. 
Preliminary assessment of activity may also be made by an 
in vitro assay, for example that of King, R. j, , et al. Am 
J Physiol ( 1972) 22_3: 7 15-726 , or that described and il- 
lustrated in WO87/06588 of Hawgood, et al, which utilizes 
a straightforward measurement of surface tension at a air- 
water interface when the protein is mixed with a 
phospholipid vesicle preparation. All of the SP-18- and 
SP-5-derived peptides described and claimed herein show 
ASP activity. 

The "hSP-5-derived peptides" of the invention 
are intended to include polypeptides which are based on 
the amino acid sequence encoded by the human SP-5 DNA 
shown in Figures 1-2, especially those portions encoding 
the portion of the precursor amino acid sequence shown in 
Figure 3, and which have ASP activity as defined above. 
These SP-5 peptides are defined by the amino acid sequence 

X-AA2g-AA2g-AA3Q-AA3^-AA32-AA33-AA34-AA35.Leu- 

Leu-Ile-Z-Z-Z-Z-Z-Z-Leu-Ile-Z-Z-Z-Ile-Z-Gly-Ala-Leu-Leu- 
Met-Y, 



wherein 










is 


Cys 


or 


Ser, 


AAjg 


is 


Cys 


or 


Ser , 


^30 


is 


Pro 


or 


Ala, 


^31 


is 


Val 


or 


Gin, 


AA32 


is 


His 


or 


Lys, 


AA33 


is 


Leu 


or 


Ala, 
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AA^^ is Lys or Gin, 
AA^g is Arg or Gin, 
Z is either Val or lie, 

Y is OH, Gly-OH, Gly-Leu-OH, Gly-Leu-His-OH, or 
Gly-Leu-His-Y^^ wherein Y^^ is a C-terminal extension 
sequence of 1-15 amino acids corresponding to amino acids 
60-74 in Figure 3, and 

X is H or an amino acid sequence selected from 
the group consisting of H-AA2^-/ H-AA2g-AA2 y- , or X'-AA^g- 
AA27-, 

wherein: 

AA2y is Pro or Ala, 
AA2g is lie or Ser, and 

X' is H or an N-terminal extension sequence of 
1-25 amino acids corresponding to amino acids 1-25 in 
Figure 3, 

or the pharmaceutically acceptable salts or 
amides thereof, with the proviso that if X is Phe-Gly-Ile- 
Pro, Y is His-Yj^ wherein Y^ is a C-terminal extension of 
amino acids 60-66 and all Z are Val, AAjg-AA^^ cannot be - 
Cys-Cys-Pro-Val-His-Leu-Lys-Arg- . 

Preferred embodiments of Y in hSP-5-derived 
peptides within the aforementioned group are those wherein 
Y is Gly-Leu-OH or Gly-Leu-His-Y^ wherein Y^ is the C- 
terminal extension corresponding to the 15 amino acids 
numbered 60-74 in Figure 3 . Preferred embodiments of X 
are those wherein X is H, H-AA2y-, ^"^26*^27"' ^•''^"^2 6" 
AA2 ^ - or Phe-Gly-AA2 g - AAj -j - . 

And as will be discussed below, particularly 
preferred SP-5 analogs within the aforementioned group are 
those wherein both AA2g and AA^p are Ser. While not wish- 
ing to be bound by any theory, the inventors demonstrate 
herein that replacement of the two native Cys residues at 
these positions with Ser reduces intramolecular and 
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intermolecular disulfide bonding and thus, correspond- 
ingly, reduces protein aggregation • 

The "hSP-18-derived peptides" of the invention 
^ include peptides having the amino acid sequences shown in 
Figure which shows human clone #3,. spanning positions 
201 to a carboxy terminus at positions 275-281. 
Particularly preferred is the SP-18 protein spanning posi- 
tions 201-279. 

10 

Production of the Protein 

The shorter forms of the hSP-18 and hSP-5- 
derived peptides of the invention can be prepared by solid 
phase peptide synthesis or by other standard peptide 
"^^ synthetic means. These peptides are also conveniently 
produced using recombinant vectors and hosts. 

Host of the techniques which are used to 
construct vectors,, transform cells, effect expression in 
transformed cells and the like are widely practiced in the 

2 0 

art^ and most practitioners are familiar with the standard 
resource materials which describe specific conditions and 
procedures. Illustrative methods as they apply to the 
peptides of the invention are set forth with particularity 
in W086/03408 and WO87/06588. 

25 

Expression may be achieved in a variety of host 
systems including, in particular, mammalian and bacterial 
systems, as well as yeast-based systems. In addition, 
other cell systems have become available in the art, such 
as the baculovirus vectors used to express protein encod- 
'^^ ing genes in insect cells. The expression systems set 
forth below are illustrative, and it is understood by 
those in the art that a variety of expression systems can 
be used. 

As the nucleotide sequences encoding the various 

35 

hSP-5- and hSP-18-derived peptides are available through 
retrieval of cDNA or genomic DNA and/or using synthesis 
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methods, these may be expressed in this variety of 
systems. If procaryotic systems are used, an intronless 
coding sequence should be used, along with suitable 
control sequences. The cDNA clones for any of the above 
ASP proteins may be excised with suitable restriction 
enzymes and ligated into procaryotic vectors . for such 
expression, or synthetic coding sequences may be used. 
For procaryotic expression of ASP genomic DNA, the DNA 
should be modified to remove the introns , either by site- 
directed mutagenesis, or by retrieving corresponding por- 
tions of cDNA and substituting them for the intron- 
containing genomic sequences . The intronless coding DNA 
is then ligated into expression vectors for procaryotic 
expression . 

As exemplified, either genomic, cDNA, or 
synthetic (or partially synthetic) ASP-encoding sequences 
may also be used directly in an expression system capable 
of processing introns, usually a mammalian host cell 
culture. To effect such expression, the genomic or other 
sequences can be ligated downstream from a controllable 
mammalian promoter which regulates the expression of these 
sequences in compatible cells. 

In addition to recombinant production, proteins 
of the invention of sufficiently short length, such as the 
proteins related to the SP-5-encoded protein, may be 
prepared by standard protein synthesis methods. 

Protein Recovery 

The ASP protein may be produced either as a 
mature protein or a fusion protein, or may be produced 
along with a signal sequence in cells capable of process- 
ing this sequence for secretion. It is sometimes advanta- 
geous to obtain secretion of the protein, as this 
minimizes the difficulties in purification; thus it is 
preferred to express the human ASP gene which includes the 
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codons for native signal sequence in cells capable of ap- 
propriate processing. It has been shown that cultured 
mconmalian cells are able to cleave and process 
heterologous mammalian proteins containing signal 
sequences r and to secrete them into the medium (McCormick, 
et al, Mol Cell Biol (1984) 4:166), 

When secreted into the medium^ the ASP protein 
is recovered using standard protein purification 
techniques. The purification process is simplified, 
because relatively few proteins are secreted into the 
medium, and the majority of the secreted protein will, 
therefore, already be ASP. However, while the procedures 
are more laborious, it is well within the means known in 
the art to purify this protein from sonicates or lysates 
of cells in which it is produced intracellularly in fused 
or mature form. One such method is illustrated below. 

Assay for ASP Activity 

In vitro methods have been devised to assess the 
ability of ASP proteins to function by reducing surface 
tension (synonymous with increasing surface pressure) to 
generate a film on an aqueous/air interface. Studies 
using these methods have been performed on the isolated 
native lOK canine ASP (Benson, B.J., et al Prog Resp Res 
(1984) l£:83-92; Hawgood, S,, et al. Biochemistry (1985) 
Z4: 184-190). These methods are also applied to the 
individual synthetic and recombinant peptides. Since the 
function of the surfactant complex in vivo is to create a 
film at the air/aqueous interface in order to reduce 
surface tension, the ability of ASP proteins to enhance 
the formation of the film created by the spread of lipid 
or lipoprotein at such a surface in an in vitro model is 
clearly relevant to its utility. 

An in vivo model, described in detail in Section 
D.IO of WO87/06588, is also employed. 
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Administration and Use 

The purified proteins and analogs can be used 
alone and in combination in pharmaceutical compositions 
appropriate for administration for the treatment of 
respiratory distress syndrome in infants or adults. The 
compositions of the invention are also useful in treating 
related respiratory diseases such as pneumonia and 
bronchitis. The complex contains about 50% to almost 100% 
(wt/wt:) lipid and 50% to less than 1% ASP; preferably ASP 
is 5%-20% of the complex. The lipid portion is preferably 
70%-90% (wt/wt) DPPC with the remainder unsaturated 
phosphatidyl choline, phosphatidyl glycerol, 
triacylglycerols , palmitic acid, palmitoyl oleyl 
phosphoglyceride (POPG), or mixtures thereof. The complex 
is assembled by mixing a solution of ASP with a suspension 
of lipid liposomes, or by mixing the lipid protein 
solutions directly in the presence of detergent or an 
organic solvent. The detergent or solvent may then be 
removed by dialysis or evaporation. 

While it is possible to utilize the natural 
lipid component from lung lavage in constructing the 
complex, and to supplement it with appropriate amounts of 
ASP proteins, the use of synthetic lipids is clearly 
preferred. First, there is the matter of adequate supply, 
which is self-evident. Second, purity of preparation and 
freedom from contamination by foreign proteins, including 
infectious proteins, which may reside in the lungs from 
which the natural lipids are isolated, are assured only in 
the synthetic preparations. Of course, recons titution of 
an effective complex is more difficult when synthetic 
components are used. 

Preferred ASP compositions comprise either 
complexes with the isolated lOK mixture, the SP-5- or SP- 
18-encoded proteins alone, active SP-5 analogs, alone or 
in combination, a complex of the lOK and 32K mixtures, or 
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a complex of an SP-18 or SP-5-related protein and the 32K 
mixture. In the latter case, a preferred protein ratio — 
i.e., 32K:10K or 32K:SP-18 or 32K:SP-5 — is typically in 
the range of 3:1 to 2 00:1, preferably about 10:1 to 5:1. 
The 32K protein may be added directly to an aqueous 
suspension of phospholipid vesicles in an aqueous solu- 
tion. Because it is so hydrophobic, the lOK protein is 
added to the lipids in an organic solvent, such as chloro- 
form, the solvents evaporated, and the vesicles re-formed 
by hydration. 

The addition of the 32K protein to the lOK type 
for the administration of the surfactant complex appears 
to have a synergistic effect — i.e., the combination of 32K 
and lOK type proteins exerts the desired activity at 
protein concentrations lower than those required for the 
lOK protein alone. Accordingly, in a preferred method of 
the invention, the surfactant complex administered will 
contain an effective amount of the lOK mixture, or of the 
individual SP-5 or SP-18 proteins, or the hSP-5 or hSP-18- 
derived peptides of the invention in admixture with the 
32K ASP. Of course, mixtures of the individual hSP-5 or 
hSP-18-derived peptides can be used. Particularly 
preferred compositions contain the ratios of 32K:10K type 
protein as set forth above, along with a suitable amount 
of lipid component, typically in the range of 50% to 
almost 100% of the total composition. 

The compositions containing the complex are 
preferably those suitable tor endotracheal administration, 
i.e., generally as a liquid suspension, as a dry powder 
*'dust" or as an aerosol. F'.^r iirect endotracheal admini- 
stration, the complex is suspencled in a liquid with suit- 
able excipients such as, for example, water, saline, 
dextrose, or glycerol and the like. The compositions may 
also contain small amounts of nontoxic auxiliary substan- 
ces such as pH buffering agents, for example, sodium 
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acetate or phosphate. To prepare the "dusf\ the complex, 
optionally admixed as above, is lyophilized, and recovered-' 
as a dry powder • 

If to be used in aerosol administration, the 
complex is supplied in finely divided form along with an 
additional surfactant and propellent. Typical surfactants 
which may be administered are fatty acids and esters, 
however, it is preferred, in the present case, to utilize 
the other components of the surfactant complex, DPPC and 
PG. Useful propellants are typically gases at ambient 
conditions, and are condensed under pressure. Lower 
alkanes and fluorinated alkanes, such as Freon, may be 
used. The aerosol is packaged in a container equipped 
with a suitable valve so that the ingredients may be 
maintained under pressure until released. 

The surfactant complex is administered, as ap- 
propriate to the dosage form, by endotracheal tube, by 
aerosol administration, or by nebulization of the suspen- 
sion or dust into the inspired gas. Amounts of complex 
between about 0 . 1 mg and 200 mg, preferably 50-60 mg/kg 
body weight, are administered in one dose. For use in 
newly born infants, one administration is generally suf- 
ficient. For adults, sufficient reconstituted complex is 
administered to replace demonstrated levels of deficiency 
(Hallman, M., et al, J Clinical Investigation (1982) 
10:673-582) . 

In addition, one or more ASP proteins, including 
the hSP-18- and hSP-5-derived peptides described herein 
can be used as a carrier or vehicle for the delivery of 
other biologically active and important molecules to the 
lung and/or through the lung to the blood vasculature. In 
the latter case, delivery of agents important for other 
organs in the body can be effected. 

It is to be understood that while the invention 
has been described in conjunction with the preferred 
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specific embodiment thereof, that the foregoing descrip- 
tion as well as the examples which follow are intended to 
illustrate and not limit the scope of the invention. 
Other aspects, advantages and modifications within the 
scope of the invention will be apparent to those skilled 
in: the art to which the invention pertains . 



Examples 

Preparation A 
Isolation of Mammalian ASP Proteins 
Canine, hximan and bovine ASP proteins were 
obtained in purified form as described in WO86/03408 and 
WO87/06588, and DNA encoding the 32K protein for human and 
dog and DNA encoding the SP-18 protein of human and dog 
were recovered and disclosed in these applications . Two 
variants of the complete cDNA sequence encoding the SP-5 
precursor proteins for human ASP were recovered, as 
described in WO87/06588 and are reproduced as Figures 1 
and 2 herein. 

Example 1 

Identification of N- and C-Termini of the 
Isolated lOK Proteins 



The 5 kd Protein : 

The carboxyl terminus of the 5 kd protein in the 
lOK mixture is difficult to ascertain, as the protein is 
derived from a large precursor having a molecular weight 
of about 20,500 daltons . Mass spectrometric analysis of 
isolated native canine protein indicated the apparent 
carboxyl terminus to be His-59, as shown in the amino acid 
sequence for the canine protein in Figure 5- Amino acid 
sequence analysis of enzymatic cleavage fragments of 
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native SP-5 protein isolated from human lung lavage fluid 
has indicated a carboxyl terminus at Leu-58 (see 
Johansson, J. et al . , FEES Letters, 232 , No. 1 (1988), 61- 
64). Reanalysis of the native canine and native human 
forms of SP-5 by mass spectrometry has indicated that the 
molecular weights observed are consistent with species 
having a carboxyl terminus at Leu-58 in which the cysteine 
residues (at AA-28 in the canine and AA-28,29 in the human 
forms) are palmitylated via thioester bonds. Treatment of 
the native forms with reducing agent to remove the 
palmityl groups^ followed by mass spectrometric analysis 
and HPLC analysis confirmed the presence of these species. 
Accordingly, preferred analogs of the present invention 
may have a carboxyl terminus at either Leu-5 8 or His-59 as 
shown in Figures 1, 2 and 3. Using recombinant production 
techniques, the inventors herein have produced SP-5 
analogs having carboxyl terminii at both Leu-5 8 and His-5 9 
and shown them to be essentially equivalent in both the in 
vitro and in vivo assays described herein. In particular, 
analogs which have been demonstrated to be comparatively 
equivalent are the human SP-5 analog proteins having an 
amino terminus as Gly-25, Ser residues at AA-28 and AA-29, 
and a carboxyl terminus at either Leu-58 or His-59. 

The N-terminus of the human 5 kd protein was • 
determined by direct amino acid sequencing to be 
phenylalanine (at position 24 as shown in Figure 3), but 
truncated species were also found having glycine at 2 5 and 
isoleucine at 26 as alternative N-termini. 

The 18 kd Protein : 

The carboxy terminus of the 18 kd protein in the 
lOK mixture was analyzed using quantitative amino acid 
composition, amino acid sequencing of the protein begin- 
ning at the N-terminus, carboxypeptidase Y digestion (an 
enzyme which cleaves amino acids from the C-terminus of 
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proteins), and mass spectrometry. Figure 6 shows the 
amino acid sequences of the human and canine proteins* 

Sequence analysis of the canine and bovine 18 kd 
proteins^ after cleavage at methionine with cyanogen 
bromide, indicated the C-terminus of the canine protein to 
be His-279 and that of the bovine protein to be Ser-278. 
Enzymatic analysis using carboxypeptidase Y gave Leu-275 
as the C-terminus of both the canine and bovine proteins* 
Mass spectral analysis of the canine protein showed the C- 
terminus at Arg-276 with a minor sequence extending to 
His-279/ as predicted by amino acid sequencing after 
cyanogen bromide cleavage. In sum, the carboxy terminus 
is near His-279 in the canine protein, and, by analogy, 
near Met-27 9 in the human 18 kd protein. Based on the 
aforementioned results, there appear to be truncated C- 
terminal forms of the protein as well as truncated or 
staggered N-termini, depending on the particular prepara- 
tion and species. It is accordingly postulated by the 
inventors herein that there are probably a number of C- 
termini for a particular species. As seen in Figure 6, 
the putative N-teinninus for the human protein is the 
phenylalanine at position 201 as described in WO87/06588. 
The carboxy terminus approximates the methionine codon at 
position 279, as also shown in Figure 6. 

Preparation B 
Vector Construction for Mammalian Expression 
The hSP-18-derived proteins and hSP-5-derived 
proteins disclosed herein can be prepared using re- 
combinant techniques. Vectors suitable for expression of 
the various ASP-encoding sequences in mammalian cells, 
which are also capable of processing intron-containing . 
DNA, were constructed. In these vectors, expression is 
controlled by the metallothionein II (hMTII) control 
sequences, as described in WO87/05588. This published 
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application describes in detail the preparation of host 
vectors pMT, pMT-Apo^ pMT-SV{9), pMT-SV(lO) and pMT-ApolO. 
All of these vectors have insertion sites which permit a 
coding sequence to placed under control of the 
metallothionein promoter. Those vectors including "Apo" 
in their designation also contain the 3' terminal regula- 
tory signals associated with the ApoAI gene downstream of 
the insert region; those containing "9" or "10" in their 
designation also contain operable SV-40 viral enhancers. 

As described in the published application, 
pMTApolO was digested with BamHI, blunted and ligated to 
the cDNA sequences obtained from the clone #3 of 1275 bp 
encoding SP-18 precursor ^ as a blunted fragment. This was 
done by isolating an EcoRI/BamHI (partial) fragment from 
cDNA #3 avoiding the BamHI site at nucleotide 663 and 
subcloning into EcoRI/BamHI-digested pUC9. The desired 
fragment was excised with EcoRI and Hindlll, blunted with 
Klenow, and then inserted into pMTApolO. The resulting 
vector, pMT( E) :SP-18-40k, was transformed into CHO cells. 
Induction of the promoter in cultures of these transformed 
cells resulted in production of 25 kd and 4 3 kd proteins 
which are immunoprecipitated with antisera raised against 
human 18 kd ASP. When subjected to Western blot using 
antisera raised against a peptide spanning residues 336- 
353 of the precursor, the 25 kd and 43 kd proteins were 
detected. It is believed the 25 kd product represents a 
181 amino acid- sequence spanning Phe-201 :Leu-381, contain- 
ing a N-linked glycosylation site. 

As further described in WO87/06587, analogous 
vectors were constructed, including SP- 18-encoding DNA, 
using standard site-specific mutagenesis techniques to 
provide sites for in vitro cleavage of the precursor 
protein which was, apparently, produced in CHO cells from 
the full length sequence. In one such construct, the 381 
amino acid precursor was modified to replace each of the 
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Gln-199 :Gln-200 and Arg-286 :Ser-187 by AsniGly, to provide 
sites cleavable by hydroxyl amine (which cleaves between 
Asn and Gly) . Cleavage of the precursor thus produced 
with hydroxyl amine generates the putative mature form, 
with an additional gly residue at the amino terminus, and 
with the putative carboxy-terminal Arg-286 changed to an 
Asn residue. In another construct, Phe-201 and Ser-87 are 
changed to Asp residues. Cleavage with acid (between Asp 
and Pro) yields a mature form of the SP-18 protein missing 
the N-terminal Phe-201, and with an additional carboxy- 
terminal Asp residue. An additional construct allows in 
vitro processing of the precursor with a more gentle 
enzymatic procedure, employing Staph VB peptidase, which 
cleaves after Glu residues. Advantage is taken of natural 
Glu residues at Glu- 19 8 and Glu-291 by converting the Glu- 
251 to Asp. The 43 kd precursor is cleaved with Staph V8 
to yield the putative mature SP-18 protein with an ad- 
ditional Gln-Gln at the amino terminus, and Pro-Thr-Gly- 
Glu at the carboxy terminus. In an additional construct, 
Glu residues can be placed in positions 200 and/or 287. 

In a similar manner, the blunted EcoRI insert of 
the SP-5 clones of Figures 1 and 2 was placed into BamHI 
digested pMT-ApolO to obtain pMT(E) :SP-5 vectors^ and 
transformed into CHO cells. 

Example 2 

Mammalian Expression of DNA Encoding 
hSP-18- and hSP-5-Derived Peptides 

DNA sequences encoding the hSP-5- and hSP-18- 
derived proteins of the invention described herein are 
placed into BamHI-digested pMT-ApolO to obtain the ap- 
propriate expression vectors. Preferably, the DNA encod- 
ing the desired protein is ligated in operable linkage to 
a signal sequence effective in CHO cells. Transformation 
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into CHO cells and expression of the inserted sequences is 
conducted as described as follows. 

Chinese hamster ovary (CHO)-Kl cells are grown 
on medium composed of a 1:1 mixture of Coon's F12 medium 
and DME21 medium with 10% fetal calf serum. The competent 
cells are co-transformed with the vector of interest and 
pSV2:NE0 (Southern, P., et al, J Mol Appl Genet (1982) 
1^: 327-341) • pSV2:NE0 contains a functional gene confer- 
ring resistance to the neomycin analog G418- In a typical 
transformation, 0,5 ug of pSV2:NE0 and 5 ug or more of the 
expression vector DNA are applied to a 100 mm dish of 
cells. The calcium phosphate-DNA co-precipitation accord- 
ing to the protocol of Wigler, M. , et al. Cell (1979) 
1_6: 777-785, is used with the inclusion of a two minute 
"shock" with 15% glycerol in PBS after four hours of 
exposure to the DNA. 

Briefly, the cells are seeded at 1/10 
confluence, grown overnight, washed 2x with PBS,, and 
placed in 0.5 ml Hepes-buf f ered saline containing the 
CaPO^'DNA co-precipitate for 15 min and then fed with 10 
ml medium. The medium is removed by aspiration and 
replaced with 15% glycerol in PBS for 1.5-3 min. The 
shocked cells are washed and fed with culture medium. 
Until induction of MT-II-controlled expression, the medium 
contains F12/DMEM21 1:1 with 10% FBS • A day later, the 
cells are subjected to 1 mg/ml G418 to provide a pool of 
G418-resistant colonies. Successful transf ormants , also 
having a stable inheritance of the desired plasmid, are 
then plated at low density for purification of clonal 
isolates . 

The transf ormants are assayed for production of 
the desired protein, first as pools, and then as isolated 
clones in multi-well plates. The plate assay levels are 
somewhat dependent on the well size--e,g. results from 24 
well plates are not directly comparable with those from 96 
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well plates- Clones which are found by plate assay to be 
producing the protein at a satisfactory level can then be 
grown in production runs in roller bottles. Typically, 
the levels of production are higher when the scale-up is 
done. For this reason, typically 100-20 0 or more 
individual clones are assayed by various screening methods 
on plates and 5-10 of the highest producers are assayed 
under production conditions (roller bottle). 

Pools of transformed cells are grown in multi- 
well plates and then exposed to 5 x 10"^ to 1 x 10*^ zinc 
ion concentration to induce production of the desired ASP 
protein. 

Semiconf luent monolayers of individual cell 

lines growing in McCoy's 5A medium with 10% FBS are washed 

with phosphate-buffered saline (PBS) and refed with 

-4 

McCoy's containing 10% FBSr 1 x 10 zinc chloride, and 
0.25 mM sodium ascorbate. (Ascorbate may be helpful in 
mediating the hydroxylation of proline residues,) Twenty- 
four hours post induction, the cells are washed with PBS 
and refed with serum-free McCoy's containing the zinc 
chloride and ascorbate. After 12 hours, the conditioned 
media are harvested. 
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Preparation C 
Bacterial Expression Vectors 

As set forth in W087/06588, the unglycosylated 
forms of the ASP proteins can be produced in bacteria. 
For SP-18 proteins, the gene can be expressed, for 
example, to produce a 181 amino acid precursor represent- 
ing met-preceded residues 201-381 or as a hydroxylamine- 
cleavable fusion protein precursor with a 15 residue beta- 
galactosidase leader. A modified cDNA #3 encoding amino 
acids 201-381 of the cDNA, preceded by ATG is inserted 
into the Trp-controlled vector pTrp-233, described in 
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WO87/06588, between the EcoRI site and the Hindlll site to 
give pTrp-20. This construct produces a protein of M,W, 
20 kd. An analogous construct in the pBGal host vector, 
pBGal-20;. contains the same sequences of SP-18 cDNA #3 
fused to a 15 residue beta-galactosidase leader through a 
hydroxylamine-sensitive Asn-Gly doublet, and produces a 
fusion protein of MW = 22 kd. 

The pTrp-20 and pBGal-20 plasmids are used to 
transform E. coli W3110 to ampicillin resistance. Rapidly 
growing cultures of pTrp-20/W3110 or pBGal-20/W3110 in M9 
medium ( 1 x M9 salts, 0.4% glucose, 2 mg/ml thiamine, 200 
ug/ml MgSO^'THjO, 0.5% casamino acids, are treated with 
100 ug/ml lAA { 3-beta-indoleacrylate , Sigma 1-1625.) to 
induce the trp promoter. 

WO87/06588 also describes vectors encoding 
modified SP-18 protein sequences providing cleavage sites 
for expression in bacteria. In pTrp-20, codons encoding 
Arg-285 : Ser-287 were altered to encode Asn-Gly; introduc- 
ing the hydroxylamine-sensitive cleavage site, or the 
codon for Ser-287 was replaced by a codon for Asp, result- 
ing in the acid-sensitive Asp-Pro cleavage site, or the 
codon for Glu-251 was replaced with a codon for Asp, al- 
lowing cleavage with Staph V8 at Glu-291 without cleaving 
the desired protein. These constructs would be expected 
to generate SP-18 having the amino acid sequence of 287- 
381 or 291-381. Also, in both pTrp-20 and pBGal-20, the 
sequences 3' to the putative carboxy terminal Arg-286 were 
deleted and replaced by a step codon, putatively generat- 
ing peptides representing SF-IS codon positions 201-286. 
Neither construct resulted in labeled protein of proper 
size after induction, however. 

With respect to SP-5-derived proteins WO87/06588 
describes, analogous to pTrp-20, the insertion of the 
fragment of the cDNA #18 extending from Gly-25 preceded by 
ATG to Ile-197 of the Sp-5 "precursor" into EcoRI/Hindlll 
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digested pTrp-233 to give pTrp-5 and into the pBGal host 
vector to. give pBGal-5 wherein the SP-5 sequence is fused 
to a beta-galactosidase leader through a hydroxylamine- 
sensitive Asn-Gly« These vectors putatively generate 
SP-5-derived proteins spanning 25-197. Also, cleavage 
with Staph V8 of the protein expected from this construct 
at the Glu preceding Phe-24 and at Glu-65 would yield an 
SP-5-derived peptide spanning positions 24-66. 

All these constructs are transformed into E . 
coli W3110 and expressed as described above. 



Example 3 

Production of hSP-18 and hSP-5 Derived Proteins in 
Bacteria 

It may be advantageous to express the hSP-18- 
and hSP-5-derived peptides as a cleavable fusion protein 
with an upstream stabilizing sequence. U.S. Serial No. 
231,224 filed 11 August 1988, assigned to the same as- 
signee, and incorporated herein by reference, describes 
the construction of several vectors which contain a por- 
tion of the chloramphenicol acetyltransf erase 
(CAT) -encoding gene joined to DNA encoding a specified 
portion of the hSP-18 or hSP-5 peptides. Exemplified are 
vectors encoding 35 amino acids of an SP-5-derived 
peptide, i.e., hSP-5( 24-59) joined to CAT through a 6 
amino acid linker, Ser-Asp-Pro-Glu-Phe-Asn . As described 
in the above-referenced application. These vectors are 
prepared as follows. 

The vectors includinq the SP-18- and SP-5- 
derived proteins are obtained r rom a host vector 
constructed with an insert encoding human atrial 
natriuretic peptide ( hANP ) . This intermediate vector, 
pChNF109, is constructed as follows. 

Expression vector pChNFi09 encodes a 241 amino 
acid CAT-hANP hybrid protein containing an endoproteinase 
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Glu-C proteolytic cleavage site. The DNA and encoded 
amino acid sequences of CAT and hANP are shown in Figure 
7, Most of the CAT gene (amino acids 1-210) has been 
joined in-frame to the hANP ( 102-126 ) gene and cleavage 
site (26 amino acids) through a linker sequence (5 amino 
acids). This vector was constructed from plasmids 
pTrp2 33, pCAT21, and phNF75 which supplied the plasmid 
backbone and trp promoter-operator, the CAT gene, and the 
hANP( 102-126 ) gene and cleavage site, respectively. 

Plasmid pTrp233 was described in WO87/06588. 
Plasmid pCAT21 was constructed by insertion of the CAT 
gene (from transposon Tn9 , Alton and Vapnek, Nature (1979) 
282 ;864-869 ) into pTrp233 under the control of the trp 
promoter-operator. Plasmid pAL13ATCAT (a plasmid contain- 
ing Tn9, disclosed in co-pending U.S. Serial No. 095,742, 
filed 11 September 1987, and incorporated herein by refer- 
ence) was digested with Ndel and Hindlll and the ap- 
proximately 750 bp Ndel-Hindlll fragment containing the 
CAT gene (with the initiating Met residue encoded at the 
Ndel site) was purified using agarose gel electrophoresis. 
The CAT gene was ligated with Ndel/Hindlll-digested 
pTrp233 using T4 DNA ligase, and the resulting plasmid 
pCAT21 was isolated from E. coli MC1061. 

Plasmid phNF75 was constructed by insertion of a 
synthetic hANP gene preceded by a proteolytic cleavage 
site into plasmid pBgal (Shine et al. Nature (1980) 
285 :456) . Eight oligodeoxyribonucieotides were assembled 
into a synthetic hANP ( 102-126 ) gene preceded by an 
endoproteinase Glu-C cleavage site. The synthetic gene 
was ligated into BamHI-digested pTrp233. A plasmid with 
the insert in the orientation which gives adjacent 
Hindlll, BamHI and EcoRI sites at the 3' end of the hANP 
gene, phNF73, was identified by the size of the fragments 
generated by digestion wit:h Hindlll and PvuII . Plasmid 
phNF73 was digested with EcoRI, the hANP gene purified 
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using polyacrylamide gel electrophoresis / and the gene 
ligated into EcoRI-digested. pBgal to obtain phNF7 5. 

Expression vector pChNF109 was constructed by- 
insertion of DNA fragments containing CAT, hANP and the 
proteolytic cleavage site, and a linker sequence into 
plasmid pTrp233 . Plasmid phNF75 was digested with EcoRI 
and Hindlll, the approximately 8 0 bp EcoRI-Hindlll frag- 
ment containing hANP was purified by polyacrylamide gel 
electrophoresis, and ligated into EcoRI/Hindlll-digested 
pTrp233 to obtain phNF87 . pCAT21 was digested with Seal, 
and BamHI synthetic linkers ( 5 ' -CGGATCCG-3 ' ) were attached 
to the blunt termini. The ligation was digested with 
BamHI and the approximately 7 40 bp BamHI fragment was 
purified by agarose gel electrophoresis- The BamHI cas- 
sette and BamHI -digested plasmid phNF87 were ligated to 
obtain pChNF109 having the CAT gene fused in-frame to the 
endoproteinase Glu-C cleavage site followed by the hANP 
gene . 

By replacing the hANP encoding sequences in 
pChNF109 with SP-5 and SP-18 sequences, human SP-5- and 
SP-18-derived peptides are expressed as fusions with por- 
tions of bacterial CAT- The surfactant peptides are 
joined to the carboxy terminus of the CAT sequences 
through a hydroxylamine-sensitive asparagine-glycine link- 
age. The CAT-surf actant fusions are expressed from the 
tryptophan promoter of the bacterial vector pTrp233. 

Expression Vector pC210SP-B > 

SP-18 expression vector pC210SP-B encodes a fu- 
sion protein of 293 residues in which 210 amino acids of 
CAT are joined to the 75 amino acids of SP-18 through a 
linker of 7 amino acids containing the hydroxylamine- 
sensitive cleavage site. Cleavage of the fusion with 
hydroxylamine releases a 77 amino acid SP-18 product 



wo 91/18015 



PCT/US91/03490 



10 



15 



20 



25 



30 



35 



-27- 

containing 76 residues of SP-18;. plus an amino-terminal 
glycine residue. 

To construct pC210SP-B, the short EcoRI-Hindlll 
segment containing hANP sequences was removed from 
pChNF109, and replaced by a portion of human SP-18 cDNA #3 
(Figure 3) extending from the PstI site at nucleotide (nt) 
643 to the SphI site at nt 804. The EcoRI site was joined 
at the PstI site through two complementary 
oligonucleotides encoding the hydroxylamine sensitive 
cleavage site as well as the amino-terminal residues of 
mature SP-18 (oligo #2307: 5'-AAT TCA ACG GTT TCC CCA TTC 
CTC TCC CCT ATT GCT GGC TCT GCA-3' and oligo #2308: 5 ' -GAC 
CCA GCA ATA GGG GAG AGG AAT GGG GAA ACC GTT G-3 ' ) . The 
SphI site was joined to the Hindlll site of pTrp233 
through a second set of complementary nucleotides encoding 
the carboxy-terminal residues of the SP-18 peptide (oligo 
#3313: 5'-AGC TTA CCG GAG GAC GAG GCG GCA GAC CAG CTG GGG 
CAG CAT G-3' and oligo #3314: 5'-CTG CCC CAG CTG GTC TGC 
CGC CTC GTC CTC CGG TA-3'), 

The expression plasmid was used to transform E . 
coli stain W3110 to ampicillin resistance. Rapidly grow- 
ing cultures of pC210SP-B/W3110 in M9 medium were made 25 
ug/ml lAA (3-beta indoleacrylate, Sigma 1-1625) to induce 
the trp promoter. By 1 hr after induction, refractile 
cytoplasmic inclusion bodies were seen by phase contrast 
microscopy inside the still-growing cells. 5 hr after 
induction, the equivalent of 1 O.D.^^q of cells were 
pelleted by centrif ugation , then boiled for 5 min in SDS 
sample buffer for electrophoresis in a 12% SDS- 
polyacrylamide gel followed by staining with Coomassie 
Blue- The predicted molecular weight of the CAT: SP-18 
fusion protein is 45,000 daltons. The hybrid CAT: SP-18 
protein was estimated to comprise 15-20% of the total cell 
protein in the induced cultures. 
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PC210SP-C . 

An amino acid sequence of a 251 residue fusion 
protein is encoded in plasmid pC210SP-C. The 210 amino 
acids of CAT are joined to 35 amino acids of mature SP-5 
through a linker of 6 amino acids. The SP-5 comprises 14% 
of the total fusion. 

In Figure 8 is shown the nucleotide sequence of 
the insert in pC210SP-C, in which the EcoRI-Hindlll frag- 
ment of pC210SP-B containing SP-18 sequences has been 
replaced by a segment of human SP-5 cDNA #18 extending 
from the ApaLI site at nucleotide 123 to the Avail site at 
nucleotide 161. The EcoRI site of the CAT vector was 
joined to the SP-5 ApaLI site through two complementary 
oligonucleotides encoding the hydroxylamine sensitive 
cleavage site as well as the amino-terminal residues of 
mature SP-5 (oligo #2462: 5'-AAT TCA ACG GCA TTC CCT GCT 
GCC CAG-3' and oligo #2463: 5 ' -TGC ACT GGG CAG CAG GGA ATG 
CCG TTG-3')- The Avail site of SP-5 was joined to the 
Hindlll site of pC210SP-B through a second set of com- 
plementary nucleotides encoding the carboxy-terminal 
residues of mature SP-5 and a stop codon (oligo #2871: 5'- 
AGC TTA GTG GAG ACC CAT GAG CAG GGC TCC CAC AAT CAC CAC 
GAC GAT GAG-3' and oligo #2872: 5 ' -GTC CTC ATC GTC GTG GTG 
ATT GTG GGA GCC CTG CTC ATG GGT CTC CAC TA-3 ' ) • 

pC179SP-C > 

The amino acid sequence of the 217 residue fu- 
sion protein encoded by pC179SP-C is a slight modification 
of the sequence shown in Figure 8. In pC179SP-C, the 179 
amino acids of CAT are joined to 35 amino acids of mature 
SP-5 through a linker of 3 amino acids (Glu, Phe, Asn) . 
SP-5 comprises 16% of the total fusion. 

To construct pC179SP-C, a portion of the CAT 
sequence was removed from pC210SP-C. Starting with 
pC210SP-C, a DNA fragment extending from the Ncol site at 
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nt 603 (Figure 8) to the EcoRI site at nt 728 was removed, 
and the Ncol and EcoRI cohesive ends were rejoined with 
two complementary oligonucleotides (oligo #3083: 5 ' -CAT 
GGG CAA ATA TTA TAG GCA AG-3' and oligo #3084 : 5 ' -AAT TCT 
TGC GTA TAA TAT TTG CC-3 ' ) • In effect, 31 residues of 
CAT, and 3 residues of the linker polypeptide are missing 
in the new fusion protein encoded by vector pC17 9SP-C. 

pC149SP-C > 

The amino acid sequence of the 187 residue fu- 
sion protein encoded by pC149SP-C is a slight modification 
of the sequence shown in Figure 8. In plasmid pC14 9SP-C, 
the 149 amino acids of CAT are joined to 35 amino acids of 
SP-5 through a linker of 3 amino acids (Glu, Phe , Asn) . 
SP-5 comprises 18.7% of the total fusion. 

To construct pC14 9SP-C, a portion of the CAT 
segment of pC210SP-C extending from the Ddel site at nt 
523 (Figure 8) to the EcoRI site at nt 728 was removed and 
replaced by a set of two complementary oligonucleotides 
(oligo #3082: 5'-TCA GCC AAT CCC G-3' oligo #3081: 5 ' -AAT 
TCG GGA TTG GC-3 ' ) . The resulting sequence is shown in 
Figure 9 . 

pClOSSP-C . 

The amino acid sequence of the 144 residue fu- 
sion protein encoded by pC106SP-C is a slight modification 
of the sequence .shown in Figure 8 . . In plasmid pC106SP-C, 
the 106 amino acids of CAT are joined to 35 amino acids of 
SP-5 through a linker of 3 amino acids (Glu, Phe, Asn). 
SP-5 comprises 24% of the total fusion. 

pC106SP-C was constructed by replacing the EcoRI 
fragment of pC210SP-C { nt 302 to nt 728, Figure 8) with 
two sets of complementary oligos which were annealed, then 
ligated together through a region of homology (oligo 
#307 9: 5 ' -AAT TCC GTA TGG CAA TGA AAG ACG GTG AGC TGG TGA 
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TAT GGG ATA GTG TTC ACC CTT GT-3 ' was annealed with oligo 
#3085: 5'-ACA CTA TCC CAT ATC ACC AGC TCA CCG TCT TTC ATT 
GCC ATA CGG-3'; oligo #3080: 5 ' -TAG ACC GTT TTC CAT GAG 
CAA ACT GAA ACG TTT TCA TCG CTC TGG G-3' was annealed with 
oligo #3078: 5 ' -AAT TCC CAG AGC GAT GAA AAC GTT TCA GTT 
TGC TCA TGG AAA ACG GTG TAA CAA GGG TGA-3' ) • 

Each SP-5 expression vector was used to 
transform E. coli strain W3110 to ampicillin resistance . 
Rapidly growing cultures of expression strains were 
induced as described above. By 1 hr after induction, 
refractile cytoplasmic inclusion bodies were seen by phase 
contrast microscopy inside the still-growing cells. 5 hr 
after induction, the equivalent of 1 O^D.^^q of cells were 
pelleted by centrif ugation, then boiled for 5 min in SDS 
sample buffer for electrophoresis in a 12% SDS- 
polyacrylamide gel followed by staining with Coomassie 
Blue. Proteins of the correct KW were obtained from these 
vectors. The hybrid CAT: SP-5 protein produced by each 
vector is estimated to comprise 15-20% of the total cell 
protein in the induced cultures. 

Modification of SP-5 DNA 
To obtain modified sequences encoding hSP-5 
analogs, site-directed mutagenesis can be used. For 
example, starting with pC149SP-C, the BamHI/Hindlll frag- 
ment shown in Figure 9 is excised and cloned into mp8 . 
The insert is then subjected to site-directed mutagenesis 
using the primer 5 ' -GTG-CAC-TGG-GGA-GGA-GGG-AAT-GCC-3 ' as 
shown in the figure. This results in the codons for 
cysteine at positions 2 8 and 2 9 of the mature protein be- 
ing converted to codons for serine in these positions. 
The mutagenized BamHI /Hindi II fragment is then isolated 
and then ligated back into the expression vector pTrp233 - 

The constructs, for example, C149SP-C or the 
corresponding mutagenized vector are then transformed into 
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coli and the transformed cells are cultured using 
standard techniques. The trp promoter is induced by 
treatment of the culture with lAA, and expression of the 
5 gene encoding the desired SP-5-derived peptide is 
obtained . 

Purification of hS P-5-Deri ^ed Pen^id^^ 
The bacterial cells are then lysed by passage 
through a homogenizer. The insoluble inclusion bodies 
^0 released by this treatment are recovered by centrif ugation 
at 5000 rpm for 30 minutes or by filtration through 0 1 
mxcron Millipore Durapore membranes. The resulting inclu- 
sion bodies are washed 3x with either 1% Triton X-100 or 
m 1.0 M guanidine hydrochloride, 10 mM EDTA, 20mM Tris- 
^ 15 HCl, pH 8.0 and 100 mM collected by centrif ugation or 
filtration as described. The inclusion bodies are 
solubilized in 20 mM Tris-HCl, pH 8.0, 6M guanidine HCl 
50 mM DTT, at a concentration of 15-25 ng/ml. 

After removal of insoluble material by 
2 0 centrif ugation the fusion protein containing the SP-5- 
derived peptide is cleaved by addition of an equal volume 
of hydroxylamine (2M) in 6M guanidine hydrochloride, 50mM 
DTT containing 0.2 M K2CO3 . The cleavage is allowed to 
proceed for 48 hours. The solution is diluted to 1 2 M 
25 guanidine HCl (5 fold) with 10 mM Tris, pH 8.0, 20 mM DTT 
Thxs causes the proteins in the cleavage reaction mix to * 
Q IToT^^''^^^'' ""^^^ P^^^iPit^te is collected by centrifuga- 

The SP-5-derived peptide is then extracted from 
3Qthe majority of the remaining protein with a 

chloroform:methanol (1:1, v:v, solution containing 100 mM 
DTT. Enough of this solution is added to the precipitate 
so that the SP-5-derived peptide is 1 mg/ml; and the 
material is extracted for six hours at room temperature 
35The extract is then centrifuged to remove insoluble 
material . 



BNSOOCID: <WO 91 ia015A1J_> 



wo 91/18015 



PCr/US91/03490 



-32- 

The supernatant from this centrif ugation is then 
mixed with sulfo-propyl cellulose (0.04 ml of sulfo-propyl 
cellulose) per mg of SP-5 peptide in the extract. The 
extract is acidified with HCl to bring the concentration 
to 5mM. After allowing the SP-5 to bind overnight, the 
sulfo-propyl cellulose is extensively washed with a buffer 
containing 19 parts chloroform, 19 parts methanol, and 2 
parts 0.1 N HCl (wash buffer). Additional washes are 
performed with a solution of wash buffer adjusted to 50 mM 
DTT with solid DTT and made 20 mM sodium acetate, pH 4 , by 
addition of a stock solution of 2M Na acetate. The SP-5 
peptide is then eluted with wash buffer containing 50 mM 
DTT and adjusted to 50 mM Na-acetate, pH 6 , by adding a 
stock solution of 2M Na-acetate, pH 6. The SP-cellulose 
is removed by filtration. The final step of purification 
is gel permeation chromatography over Sephadex LM-60 using 
the wash buffer described above as the eluant. 

Example 4 
Synthetic Peptides 
Various synthetic peptides based on the human 
SP-5-encoded mixture have been synthesized using standard 
techniques. Referring to Figure 3, the following peptides 
have been synthesized using solid phase peptide synthesis: 

(1) hSP-5 ( 24-74 ) , i.e., beginning at Phe-24 and 
ending at Gly-74; 

(2) hSP-5 ( 34-74 ) , i.e., beginning at Lys-34 and 
ending at Gly-74; and 

(3) hSP-5 ( 24-61 ) , i.e., beginning at Phe-24 and 
ending at Ser-61. 



35 
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hSP-5f 24-74 ) : 

Phe-Gly-Ile-Pro-Cys-Cys-Pro-Val-His-Leu-Lys-Arg. 
Leu-Leu-Ile-Val-Val-Val-Val-Val-Leu-Ile-Val-Val-Val-lle- 
5 ^^^-^^y-A^^-^eu-Leu-Met-Gly-Leu-His-Met-Ser-Gln-Lys-His- 
Thr-Glu-Met-Val-Leu-Glu-Met-Ser-Ile-Gly. 

hSP-5f34-74) ; 

Lys-Arg-Leu-Leu-Ile-Val-Val-Val-Val-Val-Leu-ile- 
10 ^^1-Val-Val-Ile-Val-Gly-Ala-Leu-Leu-Met-Gly-Leu-His-Met- 
Ser-Gln-Lys-His-Thr-Glu-Met-Val-Leu-Glu-Met-Ser-Ile-Gly. 

hSP-5f 24-61) ! 

,^ ^ P^^-G^y-Il®-P^°-Cys-Cys-Pro-Val-His-Leu-Lys-Arg- 

U 15 ^^"-^^"-Ile-Val-Val-Val-Val-Val-Leu-Ile-Val-Val-Val-ile- 
Val-Gly-Ala-Leu-Leu-Met-Gly-Leu-His-Met-Ser. 

synthetic peptides were prepared by standard 
t-BOC solxd-phase peptide synthesis methodology using an 
20 Applied Biosystems 430A peptide synthesizer. The protect- 
ing groups used were: Cys - 4-methyl benzyl; His - Hoc- 
Lys - 2-chloroCBZ, Arg - Tosyl . The resin was built up' 
on Boc-Ser(OBzl)-0-PAM resin (0.5 nunol, 0.72 meq/g load- 
mg). All residues were single coupled with HOST estER 
25 AND His residues which were double coupled using Boc- 
His(Boc)-OH as its symmetrical anhydride. Resin was " 
removed prior to addition of Arg at residue 12. This 
Q resin can be directly sequenced in a gas phase sequencer 

at this point, since most of the side chains are not 
30Protected, and run as authentic c-TH-amino acids. After 
confirming that the sequence wa. reasonably homogeneous, 
the synthesis was completed. The resin was cleaved and 
all protecting groups removed using standard HF cleavage 
conditions. To 1 g of peptide-resin was added 1 . 5 ml 
35anisole, 0.25 ml methyl ethyl sulfide and 15 ml of 
dxstilled HF, and cleavage was conducted in a standard 
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Kel-F HF cleavage apparatus. Cleavage was effected at - 
103C for 30 min followed by an additional 30 min at 03C. 
Hf was immediately and rapidly removed in order to 
5 minimize aggregation. However, aggregation still occurs 
to some degree since only very small quantities of the 
selected peptide can be resolubilized from the cleavage 
mixture. After removal of HP, the resin-peptide mixture 
was alternately washed with ether and chloroform and then 
10 dried. The peptide cannot be solubilized using standard 
aqueous extraction but must be separated from resin by 
solubilizing in MeOH/chloroform/HCl . 

After solubilization in 75% trif luoroacetic 
acid, each of the peptides was purified. a preferred 
15 method of purification is gel filtration using LH-60 

columns in a solution of chloroform and methanol (1:1 v/v) 
containing 0.5% HCl . Each of the synthetic peptides was 
tested for in vitro and in vivo activity as in Example 5 . 

20 Example 5 

Activity of Synthetic Peptides 
In vitro and in vivo activity was evaluated 
using the procedure described in PCT Publication W087/ 
06588, Cited sugra, at Section D.9 and D.IO, respectively 

25 Peptide 34-74 was ineffective in phospholipid 

fxlm spreading in vitro, and, as might be expected from 
this result, was also ineffective in premature rabbit 
lungs. The N-terminal amino acids would thus appear to be 
required for maximal activity. 

30 Peptide 24-74, a C-t<.rminal extended peptide 

was quite effective in both decreasing the surface tension 
of an air-water interface in vitro and in effecting 
reasonable lung function in animals. In Table 1 p is 
a measure of how effective the surfactant f ormulati jn^s 

35 xn lowering surface tension in the lungs. This decreased 
tension is manifested by a decreased pressure of inspired 
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oxygen. Peptide 24-74 was quite effective compared to the 
saline control solution, and nearly as effective as the 
rabbit surfactant positive control. It should be noted 
that the native 5 kd protein is as effective as the 
surfactant control. 



10 



10 min. 



Table 1 

P . 
ms 



2 0 min. 



3 0 min. 



N 



15 



20 



25 



30 



Rabbit 

Surfactant 25 
SP 5K 

10:1 27 
24-74 

10:1 28 

Sodium 
Chloride 



23 

22.5 
24 



20 
20 
22 



ms 



35 34 33 

values at 10,20 and 30 minutes refer to the 



5 
3 



35 



inspiratory pressures (cm H2O) required to maintain tidal 
volumes in the lung of 6-7 mis /kg body weight. (The lower 
the pressure on the ventilator, the better.) 

Peptide 24-61 was found to be as effectiv^e in 
vivo as native surfactant. In fact, in certain animal 

was lower in the animals treated 

In all cases, 

the phospholipid mixture was DPPC:egg PG (7:3, w/w) and 
the ratio of PL to protein was 10:1. It is preferable 
that the peptide be administered in conjunction with ad- 
ditional lipids as described infra , and, accordingly, in 
the studies summarized in Table 2, 10 wt . % palmitic acid 
was incorporated into the formulations. Thus, the formula 



experiments, the P. 
^ ' xns 

with 24-61 than in the surfactant control. 
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was DPPCiPG: Peptide: fatty Acid in a weight ratio of about 
10:1:1:1. 

Table 2 

Five different experiments were carried out in vivo. The 
numbers refer to pressures, ^^^13/ 30 minutes. 

Study # Surfactant NaCl 24-61 

119 22 0* 22 

120 16.5 26 15 

121 19 33 16 

122 16.5 33 15 

123 0* 32 15.5 

Ave 18.5 31 16.7 

*0 refers to a pneumothorax before the end of the 
experiment . 

In all cases, 24-61 synthetic peptide was mixed with 
PL: Palmitic acid:syn pep (10:1:1) by weight. The 
phospholipid (PL) is DPPC:PG (7:3) by weight. 

Example 6 
Additional hSP-5 Peptides 
The following SP-5-derived peptides of the 
invention have been synthesized using the CAT fusion 
method of Example 3 or by solid phase synthesis and tested 
for ASP activity: 

(1) hSP-5 ( 28-59 ) , i.e., beginning at Cys-28 and 
ending at His-59; 

(2) hSP-5(30-59 ) , i.e., beginning at Pro-30 and 
ending at His-59; 

(3) D5K#1: fS^^ S^^-hSP-5(24-59) ) , i.e., 
equivalent to hSP-C ( 24-59 ) , but having serines substituted 
for the cysteines at positions 28 and 29; 

(4) D5K#2: (A^"^ S^^ S^^ A^°-hSP-5 ( 24 -59 ) ) , 
i.e., beginning at Phe-24 and ending at His-59, with 
serines substituted for the cysteines at positions 28 and 
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29 and alanine substituted for proline at positions 27 and 
30; and 

(5) D5K#3: (S^^ g^^ S^^ Q^^ k^^, a^^, 

^ hSP-5 ( 25-59 ) , beginning with Gly-25 and ending at His-59, 
with amino acids 26 through 33 replaced as indicated. 

After solubilization, each of the peptides was 
purified as described above. Each of the synthetic 
peptides was tested for in vitro and in vivo activity, 
using the above-described procedures. 

Figure 10 represents a control, and is a graphic 
representation of surface pressure versus time obtained 
with the full length human 5 kd protein formulated with 
DPPC:PG and palmitate (ratio of DPPC :PG:palmitate : protein 
approximately 7:3:1:0.5). Figure 11 is similar, 
representing the results obtained with the full length 
human 5 kd protein formulated with the same components 
(7:3:1:1) and tested at a pH of 7,5 and a temperature of 
37^C. 

The synthetic peptide hSP-5(28-59 ) , formulated 
with DPPC:PG and palmitate as in the control experiment 
represented in Figure 12, showed an initial rate of 
spreading identical to that observed with the natural 
peptide . 

The SP-5-derived synthetic peptide, hSP-5 ( 30- 
59j_/ has reduced activity (Figure 13). That this loss of 
activity is not specifically due to the absence of the 
cysteine residues at positions 28 and 29 is suggested by 
the observation that the D5K#1 synthetic peptide (S^^,S^^- 
hSP-5 ( 24-59 ) )., in which these cysteine residues were 
replaced with serines, showed full activity (Figure 15). 
This result implies that the reduction in polypeptide 
length, rather than loss of specific residues, results in 
the loss of ASP activity . 

The D5K#1 peptide ( ^ , ^-hSP-5 ( 24-59 ) ) was 
produced recombinantly and purified as described in 



20 



25 



30 
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2 8 2 9 

Example 2. The recombinant SP-5 peptide (S ,S -hSP- 
5(24-59)) was tested using the standard protocol for in 
vitro assay with the following modification: the 
phospholipid formulation was 7 parts DPPC:3 parts POPG 
where POPG is (palmitoyl-oleoyl PG) . The final formula- 
tion was 2 0 parts phospholipid mixture:! part protein by 
weight. The Ser-Ser analog was highly active in vitro and 
at least as effective as purified rabbit surfactant in 
vivo . 

The D5K#1 synthetic peptide { S^^ , S^^-hSP-5 ( 24- 
59)) peptide analog was also active in vivo when tested 
according to the procedure set forth in WO87/06588r as 
shown below: ^r'N 

15 

D5K#1 ^ S ^ ^ -hSP-5f 24-59) (cys — >ser change) 



10 



20 



25 



30 



35 



P,. ^ at 30 minutes: 
ms 

n = 4 cys — > ser 18.0+0-7 (SD) cm of 

rabbit 18.5+1.2 cm of 

surfactant 
NaCl 34.6+1.2 cm of H^O 

^TOT minutes: 

cys — > ser peptide 0.486+0.3 

rabbit surfactant 0.494+0-7 

NaCl 0.224+0.2 

28 29 — ' ? 

Recombinant S -hSP-5 ( 24-59 ) has also been tested 

in vivo and is at least as effective as purified rabbit 

surfactant . 

2 8 ° 

The peptide S , S"" -hSP-5 ( 25-59 ) was produced 

recombinantly in bacteria as described in Example 3 and 

purified. Using standard techniques of site-directed 

mutagenesis, the vector for this peptide was modified to 

eliminate the codon for His (59). The vector was used to 

2 8 2 9 

recombinantly produce the peptide S , S -hSP-5 ( 24-58 ) . 
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2 8 2 9 

The recombinant peptide S ,S-hSP-5( 24-59) and hSP-5(24- 
59), i.e., the peptide without the Ser-Ser mutation, were 
tested side-by-side using the standard protocol for the in 
vitro assay with the following modification: the 
phospholipid formulation was 7 parts DPPC:3 parts POPG. 
The final formulation was 20 parts phospholipid mixture ;1 
part protein by weight. The superimposed plots of surface 
pressure, shown in Figure 17, indicate that the Ser-Ser 
analog was at least as effective in vitro as the 
corresponding recombinant protein having the native 
sequence . 

The S^^,S^^-hSP-5( 24-59) peptide was also tested 
in the in vitro assay with the same modifications. The 
plot of surface pressure, presented in Figure 18, 
indicates that removal of His {59) does not change the 
efficacy of the protein. 

The recombinantly produced peptides 
S^^,S^^-hSP-5{24-59) , S^^ , S^^-hSP-5 ( 25-58 ) and hSP( 25-59) 
were tested in vivo and compared with isolated rabbit 
surfactant. Results are shown below: 



P. „ at 30 minutes: 
ins 

rabbit surfactant 17.5 cm of HjO 

S^^,S^^-hSP-5{24-59) 16.0 cm of K^O 

S^^,S^^-hSP-5(25-58) 16.0 cm of H^O 

hSP(25-59) 16.0 cm of 



Further suggestion that specific amino terminal 
sequences are not a requirement for full activity is 
provided by the analysis of D5K#2 and D5K#3 . These 
peptides, in which more extensive substitutions have been 
made in the amino terminal region^ also retain full activ- 
ity in vitro (Figures 15 and 16). 

Other peptides which the inventors herein 
believe may be useful in the present method are the 31-61, 
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30-61, 28-61 and 26-61 peptides of the human SP-5-encoded 
protein shovm in Figure 3 . 
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Claims 

1. A purified polypeptide having ASP activity 
^ and defined by the sequence 

X— AA2 g """^^^2 9 """^^^^ 0 ""^^^3 1 ''^^*^3 2 """^^^3 3 ~"^^^3 4 """^^^S 5 ~ 
Leu-Ile-Z-Z-Z-Z-Z-Z-Leu-Ile-Z-Z-Z-Ile-Z-Gly-Ala-Leu-Leu- 
Met-Y^ 
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wherein: 

^28 Ser, 
AAjg is Cys or Ser, 
AA^Q is Pro or Ala, 
AA-jj is Val or Gin, 
AA22 is His or Lys, 
AA22 is Leu or Ala, 
AA^^ is Lys or Gin, 
AA^g is Arg or Gin, 
Z is either Val or lie, 

Y is OH, Gly-OH, Gly-Leu-OH, Gly-Leu-His-OH, or 
Gly-Leu-His-Yj^ , wherein Y^^ is a C-terminal extension 
sequence of 1-15 amino acids corresponding to amino acids 
60-74 in Figure 3, and 

X is H or an amino acid sequence selected from 
the group consisting of H-AAjy-, H- AA2 g -AAj ^ - , or X'-AA^g- 
AAj^-f 

wherein: 

AA27 Ala, 
AAjg is lie or Ser, and 

X' is H or an N-terminal extension sequence of 
1-25 amino acids corresponding to amino acids 1-25 in 
Figure 3, 

or the pharmaceutically acceptable salts or 
amides thereof, with the proviso that if X is Phe-Gly-Ile- 
Pro, Y is His-Yj^ wherein Y^^ is a C-terminal extension of 
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amino acids 60-66 and all Z are Val, AA2g-AA25 cannot be - 
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Cys -Cys -Pro- Val -His -Leu-Lys -Ar g- . 



2- The polypeptide of claim 1, wherein Y is 
Gly-Leu-OH or Gly-Leu-His-Y^^ • 

3. The polypeptide of claim 1^ wherein X is 
^21'' ^26 "^27"' *^-'-y*^26*^27" Phe-Gly-AA2 g-AAj^- . 

4. The polypeptide of claim 2, wherein X is H, 
^27"' ^2 6 "^27"' ^"'-y"^26"'^2 7" ^^^-^^Y'^2S'-^2l' ' 

5. The polypeptide of claim 4, wherein both 
AAjg and Ser. 

6. The polypeptide of claim 5, wherein both 
^21 ^30 Ala. 

7. The polypeptide of claim 6^ wherein AA^^ is 
Gin, ^^32 f ^33 Ala. 

8. The polypeptide of claim 1, having the amino 
acid sequence selected from 

Phe-Gly-Ile-Pro-Ser-Ser-Pro-Val-His-Leu-Lys-Arg- 
Leu-Leu-Ile-Val-Val-Val-Val-Val-Val-Leu-Ile-Val- 
Val-Val-Ile-Val-Gly-Ala-Leu-Leu-Met-Gly-Leu-His , 

Phe-Gly-Ile-Ala-Ser-Ser-Ala-Val-His-Leu-Lys-Arg- 
Leu-Leu- Ile-Val-Val-Val-Val-Val -Val -Leu- I le-Val- 
Val-Val-Ile-Val-Gly-Ala-Leu-Leu-Met-Gly-Leu-His , 

Gly-Ser-Ala-Ser-Ser-Ala-Gln-Lys-Ala-Lys-Arg-Leu- 
Leu-Ile-Val-Val-Val-Val-Val-Val-Leu-Ile-Val-Val- 
Val-Ile-Val-Gly-Ala-Leu-Leu-Met-Gly-Leu-His, 
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Phe-Gly-Ile-Pro-Cys-Cys-Pro-Val-His-Leu-Lys-Arg- 
Leu-Leu- Ile-Val-Val-Val-Val -Val-Leu-Ile-Val-Val- 
Val-Ile-Val-Gly-Ala-Leu-Leu-Met-Gly-Leu-His-Met- 
Ser-Gln-Lys-His-Thr-Glu-Met-Val-Leu-Glu-Met-Ser- 
Ile-Gly, and 

Phe-Gly-Ile-Pro-Cys-Cys-Pro-Val-His-Leu-Lys-Arg- 
Leu-Leu-Ile-Val-Val-Val-Val-Val-Val-Leu-Ile-Val- 
Val-Val-Ile-Val-Gly-Ala-Leu-Leu-Met-Gly-Leu-His- 
Met-Ser. 

9. The polypeptide of claim 1, having the amino 
acid sequence 

Gly-Ile-Pro-Ser-Ser-Pro-Val-His-Leu-Lys-Arg-Leu- 
Leu-Ile-Val-Val-Val-Val-Val-Val-Leu-Ile-Val-Val-Val-Ile- 
Val-Gly-Ala-Leu-Leu-Met-Gly-Leu-His . 

10. The polypeptide of claim 1, having the 
amino acid sequence 

Gly-Ile-Pro-Ser-Ser-Pro-Val-His-Leu-Lys-Arg-Leu- 
Leu-Ile-Val-Val-Val-Val-Val-Val-Leu-Ile-Val-Val-Val-Ile- 
Val-Gly-Ala-Leu-Leu-Met-Gly-Leu . 



30 



11. A pharmaceutical composition effective in 
^ treating respiratory distress syndrome ( RDS ) in mammals, 

which composition comprises the polypeptide of claim 1 in 
admixture with a pharmaceutical].-/ acceptable excipient. 

12- A method for treating respiratory distress 
syndrome (RDS) in mammals, which comprises to a mammal in 
need of such treatment an effective amount of the 
polypeptide of claim 1. 
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13. A recombinant DNA in isolated form consist- 
ing essentially of DNA encoding the peptide of claim 1. 

14. A DNA comprising an expression system 
capable,- when transformed into a recombinant host, of 
expressing the DNA of claim 13. 

15. A recombinant host transformed with the 
expression system of claim 14. 

16. A method to produce a peptide of the 

formula 

X-AA2 g -AAj g -AA^ Q -AA^ ^ - AA^ ^ - AA^ 3 -AA^ ^ -AA^ 5 -Leu- 
Leu- lie- Z-Z-Z-Z -Z -2 -Leu- Ile-Z-Z-Z -Ile-Z -Gly-Ala-Leu-Leu- 
Met-Y, 



wherein 


* 






AAjQ 


is 


Cys 


or 


Ser, 


^29 


is 


Cys 


or 


Ser, 


^30 


is 


Pro 


or 


Ala, 


^31 


is 


Val 


or 


Gin, 


AA32 


is 


His 


or 


Lys, 


AA33 


is 


Leu 


or 


Ala, 


AA34 


is 


Lys 


or 


Gin, 


AA35 


is 


Arg 


or 


Gin, 


Z is 


either 


Val or lie, 


Y is 


OH 


Gly- 


-OH, 


Gly-Leu-OH, Gly-Leu-His-OH , or 



Gly-Leu-His-Y^ , wherein Y^ is -3 C-terminal extension 

sequence of 1-15 amino acids c?r responding to amino acids 

60-74 in Figure 3, and 

X is H or an amino acid sequence selected from 

the group consisting of H-AA^-.-, H-AA-.^-AA^-- , or X'-AA-.^- 
35 zb^/ 26 
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wherein : 

P^21 Ala, 
AA2g is lie or Ser, and 

X ' is H or an N-terminal extension sequence of 
1-25 amino acids corresponding to amino acids 1-25 in 
Figure 3 , 

or the pharmaceutically acceptable salts or 
amides thereof, with the proviso that if X is Phe-Gly-Ile- 
Pro, Y is His-Yj^ wherein Y^^ is a C-terminal extension of 
amino acids 60-66 and all Z are Val, AA2g-AA25 cannot be 
-Cys-Cys-Pro-Val-His-Leu-Lys-Arg- ; 

which comprises culturing the cells of claim 15 
and recovering the peptide. 

17. A purified polypeptide having ASP activity 
and defined by the sequence shown as amino acids 201 to 
275-281 in Figure 3- 



20 



25 



30 



35 
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FIG. 6 



HUMAN VS DOG SPi8 
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MAESHLLQWLLLLLPTLCGPGTAAWTTSSLACAQGPEFWCQSLEQALQCRALGHCLQEVW 



LL-W-LLLLPTLCGLGAADWSAPSLACA-^RGPAfWCQSLEQALQCRALGHCLQEVA^ 



70 80 90 LOO no 120 

GHVGADDLCQECEDIVHILNKMAKEAI FQDT/iRK FLEQECNVLPLKLLMPQCNQVLDDYF 



GNARADDLCQECQDIVRI LTKMTKEAI FQDM VR K F L EH ECD V L P L K L LT PQCH HM LGTY F 



130 140 iSC 160 170 180 

?LV I DYFQNQ I DSNG I CMHLGLCKSRQ?E?EQ£?GMSDP LP KPLRDPLPOPLLDKLVLPV 



PVVVDYFQSQINPKI ICKHLGLCKPGLPEPEQES ELSDPL LDKLI LPE 

T 
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LPGALQARPGPHTQDLSEQqrPI PLPVCWLCRALI KRIQAMI PKGALRVAVAQVCRVVPL 



LPGALQVT-GPHTQDLSEQqLPIPLPVCWLCRTL I K R I QAM ! F KG VLAVTVGQ VCH VVP L • 

25C 260 270 2slp 290 

VAGG:COCLA£?/:^SVr LLDTLLGRHu?C!-VC?XVLRCSriDDSAG?RS?TG--EWLPRDSE 

VVG3ICQCLGERVTVLLLDALLGRMLPQLVCGLVLRCSHEDSAGPALASLPSEWSPQESK 

I 
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CHLCr-iSVTTQAGNSSEQAI PQAHLQACVGSWLOREKCKQFVEQHTPOLLTLVPRGWDAHT 

CQLCHFVTTOAGNHSEQATPQAI RQACLSSWLDROKCEQFVEQHMPRLOTLASGGRDAHT 



370 380 
TCQALGVCGTMSSPLQCr HSPDL . 

TCQALGACRTTFSPLOCIHI PUF . 
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FIG, 



if 



MI2-Met Clu Lys Lys lie Thr Cly Tyr Thr 
ATC CX AAA AAA ATC ACT CGA TAT ACC 



His Phe Clu Ala Pr« Cln Ser Val AJa 
or TTT CAC CO^ TIT C>C TCA CTTT OCT 



lie Ttir Ma Phe Leu Lys Thr VaJ. Lys 
ATT ACC CCC TTT TTA AAC AOC CTA AAC 



liis lie Uiu Ala Arg U?u Met Aiji Ala 
CAC A . 1 CTT CCC OCC CTC ATC A^T CCT 



I 



^'iu "^^^ ^ -'^^ 

OC CTC ATA TCC CAT ACT CTT C^C 



I --^r Cor Uru Trp £-er Clu Tyr 

I , . . TCA TCC CTC TCC ACT CA.A. TAC 



Vsp val Ala C>-s T>x Cly Clu 
CAT CTC CCC TUT TAC CCT C%A 



■'■^ "'^ Pf"© 
A.^T A . _ ... TTC CTC TCA CCC A.VT CCC 



10 

Thr VaX Asp 
ACC CTT CAT 

30 

Cln Cys Thr 
CAA TCT ACC 

SO 

Lys Asn Lys 
AAA AAT AAC 

70 

Mis Pro Clu 
CAT CCC CK\ 

90 

Pro Cys Tyr 
CCT TCT TAC 

110 

His .KSp Asp 
CAC CAC CAT 

130 

Asn Leu Ala 
A.AC CTC CCC 

150 

Trp vai sor 
TCC CTC ACT 



lie Ser Cln 
ATA TOC CAA 



Tyr Asn Cln 
T^T AAC CAC 



Trp 

TCC 



His Arg 
CAT OCT 



Thr 
ACC 



His Lys Phe 
&C AAC TTT 



FN2 Arq 
TTC CCT ATC 



Tyr 
TAT 



CO\ 



val Prv^ 



?N? Arg Cln 

rrc CCC cac 



T\-r Phc Pro 
TAT TTC CCT 



TC ACC A^ I 



His 
CaT 



Prv? 



Val Cln 
CTT CAC 



Pro Ala 
CCC CCC 



nrt Lys 
ATC K^Jk 



Clu CLn 

Gv: c^^ 



L/.-J His 
CZ.\ CAC 



Ala A^. v.c:. Asp Asn P^^ ptv; Ala Pro 
V-.: ATC CAC A.AC TTC TTC CCC CCC 

•--^ L.2U Kct Pro Leu Ala lie 

C\C A^- CTC CTC ATC CCC CTC CCC ATT 



•'al Cly i^g Met Leu Asi Clu Leu Cln 
CTC CrC AZA ATC CTT AAT CAA TTA CAA 



_ 220 

•Arg Ser Scr Cys Phe Cly Cly Arg Met 
CCC iw. TCT TCT TTC CCT CCT CCT ATC 

240 



170 

Val PJv! T>j- 
CTT TTC ACC 



190 
CLn 
CAC 



.'al H:s 
TTT C^T 



."-01 Cly Ly- 
ATC CCC A.U 



r.iS AUi Val 



Cly Phe 
CCC TTT 



Asp L'ju 
C'.T TTA 



TjT Tr_r 
.AT A.CC 



^JTp Cly 
0\T CCC 



20 

Lrs Clu 
AAA CAA 

o 

Leu Asp 
CTC GAT 

60 

Phe lie 
TTT ATT 

GO 

Asp Cly 
C\C CCT 

:co 

Thr Clu 
ACT G5».\ 

i:o 

lie T". r 
ATA TAT 

:':0 
He Clu 

ATT c-r; 

1 60 
A^n v^i. 
AAC CTC 

IcO 
Cln Cl^- 
C-.A CC-C 

r-op 

Phc hi IS 
TTC CAT 



210 

Cln S.iT Clu 
CAC TCC CAT 



Asp A_-r: 
CAT CCT 



?fo Clu Phi 
CCC CAA TTC 

•~\ f Aj a C In 
~ CCT C.KK 



Clu 
CAA 



Sor Cljr Leu Cly Cys 
TCT CCT TTC CCT TCT 



Pfxi Arg Tyr-ODOH 
AAC TCT TTC ACA TAC 
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100 

CTAGTTAACTAGTACGCAAGTTCACGTAAAAAGGGTATCACAT ATG GAG AAA AAA ATC ACT GGA 

MET Glu Lys Lys lie Thr Gly 



TAT ACC ACC GTT GAT ATA TCC CAA TGG CAT CGT AAA GAA CAT TTT GAG GCA TTT 

Tyc Thr Thr val Asp lie Set Gin Trp His Arg Lys Glu His Phe Glu Ala Phe 

200 

CAG TCA GTT GCT CAA TGT ACC TAT AAC CAG ACC GTT CAG CTG GAT ATT ACG GCC 

Gin Ser Val Ala Gin Cys Thr Tyr Asn Gin Thr Val Gin Leu Asp lie Thr Ala 



TTT TTA AAG ACC GTA AAG AAA AAT AAG CAC AAG TTT TAT CCG GCC TTT ATT CAC 
Phe Leu Lys Thr Val Lys Lys Asn Lys His Lys Phe Tyr Pro Ala Phe lie His 

300 

ATT CTT GCC CGC CTG ATG AAT GCT CAT CCG GAA TTC CGT ATG GCA ATG AAA GAr-- 
lie Leu Ala Arg Leu^MET Asn Ala His Pro Glu Phe Arg MET Ala MET Lys Ast..- 

GGT GAG CTG GTG ATA TGG GAT AGT GTT CAC CCT TGT TAC ACC GTT TTC CAT GAG 
Gly Glu Leu Val He Trp Asp Ser Val His Pro Cys Tyr Thr Val Phe His Glu 

400 

CAA ACT GAA ACG TTT TCA TCG CTC TGG AGT GAA TAC CAC GAC GAT TTC CGG CAG 
Gin Thr Glu Thr Phe Se.r Ser Leu Trp Ser Glu Tyr His Asp Asp Phe Arg Gin 

TTT CTA CAC ATA TAT TCG CAA GAT GTG GCG TGT TAC GGT GAA AAC CTG GCC TAT 
Phe Leu His lie Tyr Ser Gin Asp Val Ala Cys Tyr Gly Glu Asn Leu Ala Tyr 

500 

TTC CCT AAA GGG TTT ATT GAG AAT ATG TTT TTC GTC TCA GCC AAT CCC TGG GTG 
Phe Pro Lys Gly Phe lie Glu Asn MET Phe Phe Val Ser Ala Asri Pro Trp Val 

AGT TTC ACC AGT TTT GAT TTA AAC GTG GCC AAT ATG GAC AAC TTC TTC GCC CCe- 
Ser Phe Thr Ser Phe Asp Leu Asn Val Ala Asn MET Asp Asn Phe Phe Ala Pro 

600 

GTT TTC ACC ATG GGC AAA TAT TAT ACG CAA GGC GAC AAG GTG CTG ATG .CCG CTG 
Val Phe Thr MET Gly Lys Tyr Tyr Thr Gin Gly Asp Lys Val Leu MET Pro Leu 

700 

GCG ATT CAG GTT CAT CAT GCC GTT TGT GAT GGC TTC CAT GTC GGC AGA ATG CTT 
Ala He Gin Val His His Ala Val Cys Asp Gly Phe His Val Gly Arg MET Leu 



CAT 

AAT GAA TTA CAA CAG 
Asn Glu Leu Gin Gin 



linker 
TCG GAT CCG GAA TTC AAC 
Serr Asp Pro Glu Phe Asn 



SP-C 

GGC ATT CCC TGC TGC CCA GTG 
Gly lie Pro Cys Cys Pro Val 

800 

CAC CTG AAA CGC CTT CTT ATC GTG GTG GTG GTG GTG GTC CTC ATC GTC GTG GTG 
His Leu Lys Arg Leu Leu lie Val Val Val Val Val Val Leu lie Val Val val 



848 

ATT GTG GGA GCC CTG CTC ATG GGT CTC CAC TAA GCT T 
He Val Gly Ala Leu Leu MET Gly Leu His End 
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pCl<9SP-C (^6<iArc<-| t CAATTCAAATATTCTCAAATOACCTCTTCACAATTAJ^TCATCGAA 

100 

CTACTTAACTACTACCCAAOTTCACGTAAAAACaCTATCACAT ATO CAC AAA AAA ATC ACT CGA 

KST Clu Lye Lye 11* The GLy 



TAT ACC ACC GTT GAT ATA TCC CAA TCC CAT CGT AAA GAA CAT TTT GAG GCA TTT 

Tyr Thr Thr Val Aap lie S«r Gin Trp Hlc Arg Lyc Clu Hlc Phm Glu Ala Phe 

200 

CAG TCA GTT GCT CAA TGT ACC TAT AAC CAG ACC GTT CAG CTG GAT ATT ACG GCC 

Gin Ser Vdl Ala Gin Cys Thr Tyr Acn Gin Thr Val Cln Lou Acp Il« Thr Ai& 

TTT TTA AAG ACC GTA AAG AAA AAT AAG CAC AAC TTT TAT CCG GCC TTT ATT CAC 

Phe Leu Lys Thr Val Lys Lya Asn Lyc Hia Lyc 7hs Tyr Pro Ala Phe lie Hlc 

3D0 

ATT CTT GCC CGC CTG ATG AAT GCT CAT CCG GAA TTC CCT ATG GCA ATG AAA GAC 

lie Leu Ala Arg Leu HET Acn Ala Hie Pro Clu Fhe Arg rt2T Ala xMST LyG Ago 

GGT GAG CTG GTG ATA TGG GAT AGT CTT CAC CCT TGT TAG ACC CTT TTC CAT GAG 

Gly Glu Leu Val lie Trp Asp Ser Val His pro Cyc Tyr Thr Val Phe Klc Clu 

CAA ACT GAA ACG TTT TCA TCG CTC TGG A3T OKk TAC CAC GAC CAT TTC CCG CAC 

Gin Thr Glu Thr Phe Ser Sec Leu Trp Sec clu Tyr Hie Acp Asp Phe Arg Cln 



0 



Phe 



TTC 
P he 

i 

AAC 
Asn 



CTA CAC ATA TAT TCG CAA CAT CTG CCG TGT TAC CCT 

Leu His rle Tyr Ser Gin Acp Val Aia Cyc Tyr Gly 

500 

CCT .\AA GGG TTT ATT GAG AAT ATG TTT TTC GTC TCA 

Pro Lys Gly Phe lie Glu Acn met ?ho Phfi* val Ser 



- ccg 

;ggc 

jCly 



taa ggg agg 99t cac gtg-S' 

ATT CCC TGC TGC CCA GTG CAC CTG AAJV CGC CTT 
Ii.e Pro Cy8 Cys Pro Val Hie :,au Lyc Arg Leu 



CTG 
Val 

f 



600 

GTG GTC CTC ATC GTC GTG GTG ATT GTG CGA GCC CTG 
Val Val Leu lie Val Val Val lie val Gly Ala Leu 

653- 
GCT T/ 



GAA AAC CTG GCC TAT 
Glu Acn Ldu Ala Tyr 



GCC AAT CCC /GAA TTC 
Ala Acn Pro/ Glu ?he 



CTT ATC GTG GTC CTC 
L<5U lie Val Val Val 



CTC ATG GGT CTC CAC 
Leu MET Gly Leu His 
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